Reception device, information processing method in reception device, transmission device, information processing device, and information processing method

ABSTRACT

It is possible to access a presentation control information (HTML document) element contained in a transport media stream from an HTML application side. 
     A first HTML document for displaying a web page is received. In addition, the transport media stream which contains a predetermined number of transport media and a second HTML document as the presentation control information of the transport media is received. A DOM tree of the first HTML document is generated. When a video element that references the transport media stream is present in the first HTML document, the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath a node of the video element.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.14/763,689, filed Jul. 27, 2015, which is a national stage ofInternational Application No. PCT/JP2014/060874, filed on Apr. 16, 2014,the entire contents of which are incorporated herein by reference, andclaims priority under 35 U.S.C. 119 to Japanese Application No.2013-093436, filed Apr. 26, 2013.

TECHNICAL FIELD

The present technology relates to a reception device, an informationprocessing method in the reception device, a transmission device, aninformation processing device, and an information processing method.Specifically, the present technology relates to a reception device orthe like which receives and processes display control data such as anHTML document for displaying a web page.

BACKGROUND ART

Display of the web page is performed by a web browser. At this time, theweb browser acquires an HTML document (an HTML file) from a web server,parses the HTML document to generate a Document Object Model (DOM) tree,generates various rendered elements based on the DOM tree and displaysthe web page (for example, refer to PTL 1).

The Document Object Model (DOM) is present in the World Wide WebConsortium (W3C) standards. The DOM defines a tree structure in whichthe outermost tag, <html>, is the top node in relation to one HTMLdocument, and designates an interface for applying dynamic processing byJavaScript to each parameter of tags. Note that “JavaScript” is aregistered trademark.

In recent years, a method defined in MPEG Media Transport (MMT) ISO/IEC23008-1 is attracting attention as a transport method suitable for nextgeneration broadcasting. MMT not only defines the transport layer, butalso a data structure referred to as MMT Composition Information(MMT-CI) which describes the configuration of the screen or a changewith time. The MMT-CI configures presentation control information oftransport media such as video, audio, and images. The MMT-CI is definedby HTML 5.

CITATION LIST Patent Literature

PTL 1: Japanese Unexamined Patent Application Publication No.2011-065489

SUMMARY OF INVENTION Technical Problem

There is a case in which a video (Video) element is present in an HTML 5document from a web server. In this case, video display is performedaccording to the video element in a predetermined region of a web pagescreen that is displayed by a web browser. In this case, it isconceivable to reference an MMT transport media stream as a source ofthe video element.

The object of the present technology is to enable the access to apresentation control information (HTML document) element contained inthe transport media stream from the HTML application side.

Solution to Problem

The concept of the present technology is a reception device whichincludes

a first reception unit which receives a first HTML document fordisplaying a web page;

a second reception unit which receives a transport media streamcontaining a predetermined number of transport media and a second HTMLdocument as presentation control information of the transport media; and

a DOM tree generation unit which generates a DOM tree of the first HTMLdocument that is received by the first reception unit,

in which, when a video element that references the transport mediastream is present in the first HTML document, the DOM tree generationunit links the DOM tree that is generated according to the second HTMLdocument contained in the transport media stream beneath a node of thevideo element.

In the present technology, the first HTML document for displaying theweb page is received by the first reception unit. In addition, thetransport media stream is received by the second reception unit. Thetransport media stream contains the predetermined number of transportmedia and the second HTML document as the presentation controlinformation of the transport media.

For example, the transport media stream may be a transport stream inwhich first transport packets containing a payload of the transportmedia, and second transport packets containing information relating tothe transport media are time division multiplexed. In this case, thetransport packets may be MMT packets, and the second HTML document maybe an MMT-CI. For example, the second HTML document may be data havingan HTML structure to which the data structure of MPEG2-TS is mapped.

The DOM tree of the first HTML document which is received by the firstreception unit is generated by the DOM tree generation unit. When avideo element that references the transport media stream is present inthe first HTML document, the DOM tree that is generated according to thesecond HTML document contained in the transport media stream is linkedbeneath the node of the video element.

In the present technology, when generating the DOM tree of the firstHTML document, the DOM tree that is generated according to the secondHTML document is linked beneath the node of the video element.Therefore, it is possible to access a presentation control information(HTML document) element contained in the transport media stream from theHTML application side.

Note that, in the present technology, for example, an elementacquisition unit which acquires a predetermined element of the secondHTML document based on the DOM tree that is generated by the DOM treegeneration unit may be further provided. In this case, for example, itis possible to acquire and use a predetermined element of the secondHTML document in the HTML application side.

For example, a display control unit which controls display of the webpage based on the DOM tree that is generated by the DOM tree generationunit may be further provided, and the display control unit may displayinformation relating to presentation control of the predetermined numberof transport media on a display screen of the web page based on thepredetermined element of the second HTML document that is acquired bythe element acquisition unit. In this case, for example, the user canascertain the information relating to the presentation control of thepredetermined number of transport media presented on the display screenof the web page.

In addition, for example, a program for accessing a specific element ofthe second HTML document may be contained in the first HTML document,and the element acquisition unit may acquire the specific element of thesecond HTML document based on the program. In this case, for example, itis possible to easily acquire a predetermined element of the second HTMLdocument in the HTML application side.

In addition, for example, reference information for acquiring a programfor accessing a specific element of the second HTML document may becontained in the first HTML document, and the element acquisition unitmay acquire the specific element of the second HTML document based onthe program that is acquired using the reference information. In thiscase, for example, it is possible to easily acquire a predeterminedelement of the second HTML document in the HTML application side.

In addition, another concept of the present technology is a transmissiondevice which includes

a holding portion which holds a first HTML document for displaying a webpage which contains a video element that references a transport mediastream containing a predetermined number of transport media and a secondHTML document as presentation control information of the transportmedia, and which contains a program for accessing a specific element ofthe second HTML document or reference information for acquiring theprogram; and

a transmission unit which transmits the first HTML document that isheld.

In the present technology, the first HTML document for displaying a webpage is held in the holding portion. The video element that referencesthe transport media stream which contains the predetermined number oftransport media and the second HTML document as the presentation controlinformation of the transport media is contained in the first HTMLdocument. In addition, the first HTML document contains a program foraccessing a specific element of the second HTML document or referenceinformation for acquiring the program. The first HTML document that isheld is transmitted by the transmission unit.

For example, the transport media stream may be a transport stream inwhich first transport packets containing a payload of the transportmedia, and second transport packets containing information relating tothe transport media are time division multiplexed. In this case, thetransport packets may be MMT packets, and the second HTML document maybe an MMT-CI.

In the present technology, the first HTML document for displaying a webpage that is transmitted from the transmission unit contains a programfor accessing a specific element of the second HTML document containedin the transport media stream or reference information for acquiring theprogram. Therefore, it is possible to easily acquire a predeterminedelement of the second HTML document in the HTML application side in thereception unit.

In addition, another concept of the present technology is an informationprocessing device which includes

a data acquisition unit which acquires a first HTML document fordisplaying a web page; and

a DOM tree generation unit which parses the first HTML document that isacquired by the data acquisition unit and generates a DOM tree in whicha plurality of elements are associated with each other,

in which, when a video element that references a transport media streamcontaining a predetermined number of transport media and a second HTMLdocument as presentation control information of the transport media ispresent in the first HTML document, the DOM tree generation unit linksthe DOM tree that is generated according to the second HTML documentcontained in the transport media stream beneath the video element.

In the present technology, the first HTML document for displaying a webpage is acquired by the data acquisition unit. The first HTML documentthat is acquired by the data acquisition unit is parsed, and a DOM treein which a plurality of elements are associated with each other isgenerated by the DOM tree generation unit.

In this case, when a video element that references the transport mediastream that contains a predetermined number of transport media and thesecond HTML document as the presentation control information of thetransport media is present in the first HTML document, the DOM tree thatis generated according to the second HTML document contained in thetransport media stream is linked beneath the video element.

In the present technology, the DOM tree that is generated according tothe second HTML document is linked beneath the node of the video elementof the DOM tree of the first HTML document. Therefore, it is possible toaccess a presentation control information (HTML document) elementcontained in a transport media stream from the HTML application side.

In addition, another concept of the present technology is an informationprocessing device which includes

a data acquisition unit which acquires first display control datacontaining a plurality of structural units that define informationrelating to display control; and

a structured data generation unit which parses the first display controldata that is acquired by the data acquisition unit and generatesstructured data in which the plurality of structural units areassociated with each other,

in which, when a predetermined structural unit that references atransport media stream containing a predetermined number of transportmedia and second display control data containing a plurality of thestructural units that define presentation control information of thetransport media is present in the first display control data, thestructured data generation unit links the structured data that isgenerated according to the second display control data contained in thetransport media stream beneath the predetermined structural unit.

Advantageous Effects of Invention

According to the present technology, for example, it is possible toaccess the content of the presentation control information (HTMLdocument) contained in the transport media stream from the HTMLapplication side. Note that, the effects disclosed in the presentspecification are merely examples, embodiments are not to be limitedthereto and additional effects may be present.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram illustrating a configuration example of adisplay system as an embodiment.

FIG. 2 is a diagram illustrating the basic structure of an MMT-CI.

FIG. 3 is a diagram illustrating the relationship between a browserscreen and an MMT screen when a video (Video) element that references anMMT stream is present in an HTML 5 document.

FIG. 4 is a flowchart illustrating an example of a DOM tree generationprocess in an HTML DOM processing unit of a web browser.

FIG. 5 is a diagram illustrating an example of the mutual relationshipbetween the DOM tree of an HTML document and the DOM tree of an MMT-CIlinked thereto.

FIG. 6 is a diagram illustrating an example of the mutual relationshipbetween the DOM tree of an HTML document and the DOM trees of twoMMT-CIs linked thereto.

FIG. 7 is a diagram illustrating an example of an MMT-CI.

FIG. 8 is a diagram illustrating an example of the DOM tree of anMMT-CI.

FIG. 9 is a diagram illustrating an example of an HTML document.

FIG. 10 is a diagram illustrating an example the DOM tree of an HTMLdocument.

FIG. 11 is a diagram illustrating an example in which the DOM tree of anMMT-CI is linked beneath a video element of the DOM tree of an HTMLdocument.

FIG. 12 is a diagram illustrating an example of a browser screen inwhich an MMT screen is being displayed in a partial region.

FIGS. 13A, 13B are diagrams illustrating an example of an HTML documentcontaining reference information of a program (a script) for accessing aspecific element of an MMT-CI, and an example of a script filecontaining the program (the script) for accessing a specific element ofthe MMT-CI.

DESCRIPTION OF EMBODIMENTS

Hereafter, description will be given of embodiments for realizing theinvention (below, “embodiments”). Note that, the description will begiven in the following order.

1. Embodiment 2. Modification Example 1. Embodiment [ConfigurationExample of Display System]

FIG. 1 illustrates a configuration example of a display system 10. Inthe display system 10, a broadcasting station 110, a streaming server120, and a web (Web) server 130 are disposed on a transmission side, anda receiver 200 is disposed on a reception side.

The broadcasting station 110 generates transport packets of the MMTstructure (refer to ISO/IEC CD 23008-1), that is, a transport mediastream in which MMT packets are contained, and transmits the transportmedia stream to the reception side through an RF transmission channel.Hereinafter, the transport media stream will be referred to as the “MMTstream”, as appropriate. In this case, the broadcasting station 110 RFmodulates the MMT stream via an appropriate application layer or thelike and subsequently transmits the MMT stream to the reception sidethrough the RF transmission channel.

The streaming server 120 transmits an MMT stream which is the same asthat handled by the broadcasting station 110 described above to thereception side through, for example, a communication networktransmission channel such as an Internet 300. In this case, thestreaming server 120 converts the MMT stream into IP packets andtransmits the IP packets to the reception side through the communicationnetwork transmission channel.

First MMT packets containing a payload of transport media such as videoand audio, and second MMT packets containing a payload of informationrelating to the transport media are time division multiplexed in the MMTstream, at least by the size of fragmented packets. A data structurereferred to as MMT Composition Information (MMT-CI) which describes theconfiguration of the screen or a change with time is defined as one typeof information relating to the transport media. The MMT-CI configurespresentation control information of transport media such as video,audio, and images. The MMT-CI is defined by HTML 5.

FIG. 2 illustrates the basic structure of an MMT-CI. A root element is“html”, and has a head element and a body element. The head element hasa plurality of view elements in addition to a title element, and each ofthe view elements has a plurality of divLocation elements. In addition,the body element has a plurality of div elements. It is possible toplace video (video), audio (audio), image (img), and the like inside thediv elements. A source element or a src attribute is used for specifyingeach medium. The view element determines the display position of a divof the body; however, the initial position is determined by thedivLocation element therebelow. The divLocation elements sequentiallyindicate the display position that changes in time-series manner.Therefore, the display position changes according to the divLocationelements without the page reloading.

Returning to FIG. 1, the web server 130 holds HTML 5 documents (HTMLfiles) for displaying web pages (an HTML 5 application) in a holdingportion such as storage. In response to a request from the receptionside, the web server 130 transmits an HTML 5 document that is being heldin the holding portion to the reception side through a communicationnetwork transmission channel such as the Internet 300 using an IPtransmission unit. It is well known that the presence of a video (video)element is permitted in HTML 5. For example, there is a case in which avideo element is present in an HTML 5 document that is transmitted fromthe web server 130 to the reception side, and the source of the videoelement is in an MMT stream. In this case, reference information forreferencing an MMT stream is present as the video element.

The receiver 200 includes an IP reception unit 201, a web (Web) browser202, an output unit 203, an IP/RF reception unit 204, an MMT decodingunit 205, and an MMT player 206. The IP/RF reception unit 204 receivesan MMT stream that is transmitted thereto through an RF transmissionchannel from the broadcasting station 110 after the MMT stream is parsedin the application layer. Alternatively, the IP/RF reception unit 204receives an MMT stream that is transmitted thereto from the web server130 through a communication network transmission channel.

The MMT decoding unit 205 subjects the MMT packets contained in the MMTstream that is received by the IP/RF reception unit 204 to unpackettingand a decoding process, obtains data such as video, audio, and images asmedia data, and also obtains meta-data and messages. In this case, theMMT-CI which describes the configuration of the screen or a change withtime is also obtained. The MMT decoding unit 205 also performs thegeneration of the Document Object Model (DOM) tree of the MMT-CI.

The MMT player 206 generates output data of images and audio accordingto the configuration (the layout) of the screen, the change with time orthe like specified by the MMT-CI based on data such as video, audio, andimages obtained by the MMT decoding unit 205.

The output unit 203 performs image display and audio output based on theoutput data of images and audio generated by the MMT player 206, or theoutput data of images and audio generated by the web browser 202. Theoutput unit is configured by a display which performs the image display,a speaker which performs audio output, or the like.

The IP reception unit 201 receives an HTML 5 document (an HTML file)that is transmitted thereto from the web server 130 through acommunication network transmission channel such as the Internet 300. Theweb browser 202 parses the HTML 5 document that is received by the IPreception unit 201, generates a DOM tree, and generates the output dataof images and audio based on the DOM tree and the like.

The web browser 202 includes an HTML parsing unit 202 a which parses theHTML 5 document, and an HTML DOM processing unit 202 b which generates aDOM tree based on the parsed results. In addition, the web browser 202includes an HTML layout processing unit 202 c and an HTML renderingprocess unit 202 d which perform a layout process and a renderingprocess based on the DOM tree or the like and generate the output dataof images and audio.

In the embodiment, when a video (video) element which references an MMTstream is present in the HTML 5 document, the HTML DOM processing unit202 b links the DOM tree of the MMT-CI that is generated by the MMTdecoding unit 205 as described above beneath the node of the videoelement. By performing the linking of the DOM trees in this manner, itbecomes possible to access the elements of the MMT-CI from the HTMLapplication side that is handled by the web browser 202.

FIG. 3 illustrates the relationship between a browser screen (HTML 5)and an MMT screen when a video (video) element that references an MMTstream is present in the HTML 5 document. In this case, the MMT screenenters a state of being inserted into a portion of the browser screen.Here, the size adjustment and the like when inserting the MMT screeninto a portion of the browser screen is performed by the HTML renderingprocess unit 202 d of the web browser 202.

FIG. 4 illustrates an example of a DOM tree generation process in theHTML DOM processing unit 202 b of the web browser 202. The HTML DOMprocessing unit 202 b starts the DOM tree generation process in stepST1, and subsequently sets i=0 in step ST2.

Next, the HTML DOM processing unit 202 b determines whether or not anelement to parse is present in the HTML document in step ST3. When anelement to parse is present, the HTML DOM processing unit 202 b parsesan element of the HTML document for each tag in step ST4.

Next, the HTML DOM processing unit 202 b determines whether or not avideo element is present in the elements of the parsed HTML document instep ST5. When a video element is present in the elements, the HTML DOMprocessing unit 202 b determines whether or not the source is an MMTstream in step ST6.

When the source is an MMT stream, the HTML DOM processing unit 202 blinks the DOM tree of the MMT-CI beneath the node of the video elementin step ST7. In other words, the HTML DOM processing unit 202 b joinsthe document of the DOM tree of the MMT plane, which serves asotherPlane[i], to the child node of the video element.

Next, the HTML DOM processing unit 202 b increments the value of i instep ST8, subsequently returns to the process of step ST3, and repeatsthe same processes as described above. Note that, when no video elementis present in step ST5, and when the source of the video element is notan MMT stream in step ST6, the process immediately returns to theprocess of step ST3. In addition, when all the elements of the HTMLdocument are parsed and there are no elements to parse in step ST3, theHTML DOM processing unit 202 b proceeds to step ST9 and completes theDOM tree generation process.

FIG. 5 illustrates an example of the mutual relationship between the DOMtree of an HTML document and the DOM tree of an MMT-CI linked thereto.Using these DOM trees, it is possible to access each element of the HTMLdocument and each element of the MMT-CI.

For example, it is possible to perform the access to the video elementof the HTML document using a script (script) such as that shown below.

var videoElm=document.getElementsByTagName(‘video’)[0];

Alternatively, if an id is attached to the video element, it is possibleto perform the access to the video element of the HTML document using ascript such as that shown below.

var videoElm=document.getElementByID(“v1”);

In addition, it is possible to perform the access to each element of theMMT-CI, for example, a view element using a script such as that shownbelow. This script is an example of a case in which the view element ofthe MMT-CI is accessed from the video element of the HTML document.

var mmtElm=videoElm.otherPlane[0].document.getElementsByTagName(‘view’)[0];

In addition, relative access to the view element of the MMT-CI from thetop of the HTML document is also conceivable. An example of a script inthis case is shown below.

var mmtElm=document.firstChild.firstChild.childNode [1].otherPlane[0].document.getElementsByTagName(‘view’) [0];

Note that, since the DOM of the MMT-CI is appropriately updated in timeseries, when the DOM of the MMT-CI is re-written dynamically from theHTML application, integrity cannot be maintained. Therefore, it isnecessary to use read only (read only). In other words, all of the DOMaccess from otherPlane onward is set to read only (read only).

FIG. 6 is a diagram illustrating an example of the mutual relationshipbetween the DOM tree of an HTML document and the DOM trees of twoMMT-CIs linked thereto. Using these DOM trees, it is possible to accesseach element of the HTML document and each element of the two MMT-CIs.

For example, it is possible to perform the access to the view element ofthe MMT-CI of the DOM tree (otherPlane[i]) using a script (script) suchas that shown below.

var mmtElm=document.firstChild.firstChild.childNode[1].otherPlane[1].document.getElementsByTagName(‘view’) [0];

FIG. 7 is a diagram illustrating an example of an MMT-CI. The htmlelement has a head element and a body element. The head element has atitle element and a view element. Textual data indicating the title ispresent in the title element. The view element has two divLocationelements. In addition, the body element has two div elements. One divelement has a video element and an audio element, and the other divelement has two img elements. FIG. 8 illustrates an example of the DOMtree of an MMT-CI.

FIG. 9 illustrates an example of an HTML document. The html element hasa head element and a body element. The head element has a title element.Textual data indicating the title is present in the title element. Inaddition, the body element has a div element and a script element. Thediv element has a p element and a video element. Textual data indicating“display start time of image2” is present in the p element. Referenceinformation of the MMT stream “src=“http://sample.mmt”” is present asthe video element.

A script for accessing various elements and acquiring the elements ispresent as the script element. Here, “varvideoElm=document.getElementsByTagName(‘video’)[0];” is a script foraccessing the video element of the HTML document. In addition, “varmmtElm=videoElm.otherPlain[0].document.getElementsByTagName(‘view’)[0];” is a script for accessing the view element of the MMT-CI from thevideo element of the HTML document.

Furthermore, “varendtime=mmtElm.getElememtById(‘Image1’).getAttributeNode(“MMT-CI:end”).nodeValue;”is a script for accessing the image1 element of the MMT-CI and acquiringthe value of “end”. It is possible to acquire “18:00”, which is thevalue of the display end time, using the script.

In addition, “var pelm=document.getElementsByTagName(‘p’)[0];” and“pelm.innerText=pelm.innerText+endtime;” are scripts for accessing the pelement of the MMT-CI and acquiring the textual data indicating “displaystart time of image2”.

FIG. 10 illustrates an example the DOM tree of the HTML document. FIG.11 illustrates an example in which the DOM tree of an MMT-CI is linkedbeneath the video element of the DOM tree of the HTML document.

A simple description will be given of the operations of the displaysystem 10 illustrated in FIG. 1. The MMT stream is transmitted from thebroadcasting station 110 to the reception side through the RFtransmission channel. Alternatively, the MMT stream is transmitted fromthe streaming server 120 to the reception side through a communicationnetwork transmission channel such as the Internet 300.

The MMT stream that is transmitted from the broadcasting station 110 orthe streaming server 120 is received by the IP/RF reception unit 204 ofthe receiver 200. The MMT stream is supplied to the MMT decoding unit205. The MMT decoding unit 205 subjects the MMT packets contained in theMMT stream to unpacketting and a decoding process, obtains data such asvideo, audio, and images as media data, and further obtains to meta-dataand messages.

In addition, the MMT-CI which describes the configuration of the screenor a change with time is also obtained by the MMT decoding unit 205. Inthe MMT decoding unit 205, the parsing of the MMT-CI is performed andthe DOM tree of the MMT-CI is generated. The various data obtained bythe MMT decoding unit 205 is supplied to the MMT player 206.

The MMT player 206 generates output data of images and audio accordingto the configuration (the layout) of the screen, the change with time orthe like specified by the MMT-CI based on data such as video, audio, andimages obtained by the MMT decoding unit 205. The output data of imagesand audio is supplied to the output unit 203. In the output unit 203,when the MMT screen is displayed, image display and audio output areperformed based on the output data of images and audio generated by theMMT player 206.

In addition, in response to a request from the reception side, an HTML 5document (an HTML file) for displaying a web page (an HTML 5application) is transmitted to the reception side from the web server130 through a communication network transmission channel such as theInternet 300. An HTML 5 document (an HTML file) that is transmitted fromthe web server 130 is received by the IP reception unit 201 of thereceiver 200. The HTML 5 document is supplied to the web browser 202.

In the web browser 202, the HTML 5 document that is received by the IPreception unit 201 is parsed, a DOM tree is generated, and output dataof images and audio is generated based on the DOM tree or the like. Inthe web browser 202, when generating the DOM tree, when a video (video)element which references an MMT stream is present in the HTML 5document, the DOM tree of the MMT-CI that is generated by the MMTdecoding unit 205 as described above is linked beneath the node of thevideo element (refer to FIGS. 8, 10, and 11).

The output data of images and audio generated by the web browser 202 issupplied to the output unit 203. In the output unit 203, when thebrowser screen is displayed, image display and audio output areperformed based on the output data of images and audio generated by theweb browser 202. Here, when a video (video) element that references theMMT stream is present in the HTML document, the MMT screen is insertedand displayed in a predetermined region of the browser screen (refer toFIG. 3).

In the web browser 202, accessing a predetermined element of the MMT-CI,acquiring and using the element are performed either according to theoperation of a viewer or automatically based on the linked DOM tree. Forexample, displaying information relating to the presentation control ofthe MMT screen on the browser screen is performed based on the acquiredelement. In the web browser 202, for example, when a script foraccessing a predetermined element of the MMT-CI is contained in the HTMLdocument, acquiring the predetermined element of the MMT-CI is performedusing the script. Note that, depending on the situation, the acquisitionof a predetermined element of the HTML document is also performed.

FIG. 12 illustrates an example of the browser screen. The MMT screen isinserted into a partial region of the browser screen. The display regionof the MMT screen is divided into two, video (video) and audio (audio)are displayed in one region, and image 1 (Image 1) is displayed in theother region. The browser screen displays “Display time of Image 2:18:00” in the proximity of the display region of the MMT screen.

Specific elements of the MMT-CI and the HTML document (the value “18:00”which is the value of the display end time, and the textual dataindicating “Display start time of image2”) that are acquired using ascript contained in the HTML document of FIG. 9 described above areused, for example, for the display information. Based on this display,the viewer of the browser screen can ascertain, in advance, that image 1(Image 1) that is displayed in the display region of the MMT screen willswitch to image 2 (Image 2) when the time reaches 18:00.

As described above, in the display system 10 illustrated in FIG. 1, inthe web browser 202 of the receiver 200, during the generation of theDOM tree of the HTML document, when a video element that references anMMT stream is present in the HTML document, the DOM tree that isgenerated according to the MMT-CI is linked beneath the node of thevideo element.

Therefore, for example, it is possible to access the elements of theMMT-CI from the HTML application side. For example, it is possible todisplay information relating to the presentation control of the MMTscreen on the browser screen (the display screen of web page) based onpredetermined elements of the MMT-CI that are acquired. Accordingly, theuser (the viewer) can ascertain the information relating to thepresentation control of the transport media presented on the browserscreen.

2. Modification Example

Note that, in the embodiment described above, an example is given inwhich a program (a script) for accessing a specific element of theMMT-CI is contained in an HTML document that is transmitted from the webserver 130 (refer to FIG. 9).

However, a configuration can be conceived in which an HTML document thatis transmitted from the web server 130 contains reference informationfor acquiring a program (a script) for accessing a specific element ofthe MMT-CI. In this case, the web browser 202 acquires a script filecontaining a program (a script) for accessing a specific element of theMMT-CI from the web server 130 based on the reference information.

FIG. 13A illustrates an example of an HTML document in this case. Here,““src=”http://sample.mmt”” is reference information for acquiring aprogram (a script) for accessing a specific element of the MMT-CI. Inaddition, FIG. 13B illustrates an example of a script file containingthe program (the script) for accessing a specific element of the MMT-CIin this case.

In addition, in the embodiment described above, an example is given inwhich the transport media stream is an MMT stream; however, the presenttechnology can, naturally, be applied equally to a display system thathandles a transport media stream that is similar to an MMT stream. Inother words, in the embodiment described above, an example is given inwhich the second HTML document is an MMT-CI; however, for example, it isconceivable for this to be data having an HTML structure to which thedata structure of MPEG2-TS is mapped.

In addition, in the embodiment described above, an example is given inwhich the streaming server 120 and the web server 130 are distinct fromeach other; however, a configuration in which these servers are formedof a single server is also conceivable.

In addition, in the embodiment described above, an example is given inwhich, as the receiver 200, the MMT decoding unit 205 and the MMT player206 are present distinctly from the web browser 202. However, it isconceivable to provide a web browser 202 which is provided with thefunctions of the MMT decoding unit 205 and the MMT player 206. In thiscase, the MMT decoding unit 205 and the MMT player 206 are notnecessary.

In addition, the present technology may adopt configurations such as thefollowing.

(1) A reception device including a first reception unit which receives afirst HTML document for displaying a web page; a second reception unitwhich receives a transport media stream containing a predeterminednumber of transport media and a second HTML document as presentationcontrol information of the transport media; and a DOM tree generationunit which generates a DOM tree of the first HTML document that isreceived by the first reception unit, in which, when a video elementthat references the transport media stream is present in the first HTMLdocument, the DOM tree generation unit links the DOM tree that isgenerated according to the second HTML document contained in thetransport media stream beneath a node of the video element.

(2) The reception device according to (1) further including an elementacquisition unit which acquires a predetermined element of the secondHTML document based on the DOM tree that is generated by the DOM treegeneration unit.

(3) The reception device according to (2) further including a displaycontrol unit which controls display of the web page based on the DOMtree that is generated by the DOM tree generation unit, in which thedisplay control unit displays information relating to presentationcontrol of the predetermined number of transport media on a displayscreen of the web page based on the predetermined element of the secondHTML document that is acquired by the element acquisition unit.

(4) The reception device according to (2) or (3), in which a program foraccessing a specific element of the second HTML document is contained inthe first HTML document, and in which the element acquisition unitacquires the specific element of the second HTML document based on theprogram.

(5) The reception device according to (2) or (3), in which referenceinformation for acquiring a program for accessing a specific element ofthe second HTML document is contained in the first HTML document, and inwhich the element acquisition unit acquires the specific element of thesecond HTML document based on the program that is acquired using thereference information.

(6) The reception device according to any one of (1) to (5), in whichthe transport media stream is a transport stream in which firsttransport packets containing a payload of the transport media, andsecond transport packets containing information relating to thetransport media are time division multiplexed.

(7) The reception device according to (6), in which the transportpackets are MMT packets, and in which the second HTML document is anMMT-CI.

(8) An information processing method in a reception device, including afirst reception unit which receives a first HTML document for displayinga web page; and a second reception unit which receives a transport mediastream containing a predetermined number of transport media and a secondHTML document as presentation control information of the transportmedia, the method including a step of generating a DOM tree of the firstHTML document that is received by the first reception unit; and a stepof linking the DOM tree that is generated according to the second HTMLdocument contained in the transport media stream beneath a node of avideo element when the video element that references the transport mediastream is present in the first HTML document.

(9) A transmission device including a holding portion which holds afirst HTML document for displaying a web page which contains a videoelement that references a transport media stream containing apredetermined number of transport media and a second HTML document aspresentation control information of the transport media, and whichcontains a program for accessing a specific element of the second HTMLdocument or reference information for acquiring the program; and atransmission unit which transmits the first HTML document that is held.

(10) The transmission device according to (9), in which the transportmedia stream is a transport stream in which first transport packetscontaining a payload of the transport media, and second transportpackets containing information relating to the transport media are timedivision multiplexed.

(11) The transmission device according to (10), in which the transportpackets are MMT packets, and in which the second HTML document is anMMT-CI.

(12) An information processing device including a data acquisition unitwhich acquires a first HTML document for displaying a web page; and aDOM tree generation unit which parses the first HTML document that isacquired by the data acquisition unit and generates a DOM tree in whicha plurality of elements are associated with each other, in which, when avideo element that references a transport media stream containing apredetermined number of transport media and a second HTML document aspresentation control information of the transport media is present inthe first HTML document, the DOM tree generation unit links the DOM treethat is generated according to the second HTML document contained in thetransport media stream beneath the video element.

(13) An information processing method including a data acquisition stepof causing a web browser to acquire a first HTML document for displayinga web page; and a DOM tree generation step of causing the web browser toparse the first HTML document that is acquired and to generate a DOMtree in which a plurality of elements are associated with each other, inwhich in the DOM tree generation step, when a video element thatreferences a transport media stream containing a predetermined number oftransport media and a second HTML document as presentation controlinformation of the transport media is present in the first HTMLdocument, the DOM tree that is generated according to the second HTMLdocument contained in the transport media stream is linked beneath thevideo element.

(14) An information processing device including a data acquisition unitwhich acquires first display control data containing a plurality ofstructural units that define information relating to display control;and a structured data generation unit which parses the first displaycontrol data that is acquired by the data acquisition unit and generatesstructured data in which the plurality of structural units areassociated with each other, in which, when a predetermined structuralunit that references a transport media stream containing a predeterminednumber of transport media and second display control data containing aplurality of the structural units that define presentation controlinformation of the transport media is present in the first displaycontrol data, the structured data generation unit links the structureddata that is generated according to the second display control datacontained in the transport media stream beneath the predeterminedstructural unit.

(15) An information processing method including a data acquisition stepof acquiring first display control data containing a plurality ofstructural units that define information relating to display control;and a structured data generation step of parsing the first displaycontrol data that is acquired and generating structured data in whichthe plurality of structural units are associated with each other, inwhich, in the structured data generation step, when a predeterminedstructural unit that references a transport media stream containing apredetermined number of transport media and second display control datacontaining a plurality of the structural units that define presentationcontrol information of the transport media is present in the firstdisplay control data, the structured data that is generated according tothe second display control data contained in the transport media streamis linked beneath the predetermined structural unit.

(16) A display system including a first transmission device whichtransmits a first HTML document for displaying a web page; a secondtransmission device which transmits a transport media stream containinga predetermined number of transport media and a second HTML document aspresentation control information of the transport media; and a receptiondevice which includes a first reception unit which receives the firstHTML document that is transmitted from the first transmission device,and a second reception unit which receives the transport media streamwhich is transmitted from the second transmission device, in which thereception device includes a DOM tree generation unit which generates aDOM tree of the first HTML document that is received by the firstreception unit, and a display control unit which controls display of theweb page based on the DOM tree that is generated by the DOM treegeneration unit, and in which, when a video element that references thetransport media stream is present in the first HTML document, the DOMtree generation unit links the DOM tree that is generated according tothe second HTML document contained in the transport media stream beneatha node of the video element.

The main characteristic of the present technology is to enable theaccess to elements of an MMT-CI from an HTML application side by, duringthe generation of a DOM tree of an HTML document (HTML application) fordisplaying a web page, when a video element that references the MMTstream is present, linking the DOM tree of the MMT-CI beneath a node ofthe video element (refer to FIG. 5).

REFERENCE SIGNS LIST

-   -   10 DISPLAY SYSTEM    -   110 BROADCASTING STATION    -   120 STREAMING SERVER    -   130 WEB SERVER    -   200 RECEIVER    -   201 IP RECEPTION UNIT    -   202 WEB BROWSER    -   202 a HTML PARSING UNIT    -   202 b HTML DOM PROCESSING UNIT    -   202 c HTML LAYOUT PROCESSING UNIT    -   202 d HTML RENDERING PROCESS UNIT    -   203 OUTPUT UNIT    -   204 IP/RF RECEPTION UNIT    -   205 MMT DECODING UNIT    -   206 MMT PLAYER    -   300 INTERNET

1. A reception device, comprising: a first reception unit which receivesa first HTML document for displaying a web page; a second reception unitwhich receives a transport media stream containing a predeterminednumber of transport media and a second HTML document as presentationcontrol information of the transport media; and a DOM tree generationunit which generates a DOM tree of the first HTML document that isreceived by the first reception unit, wherein, when a video element thatreferences the transport media stream is present in the first HTMLdocument, the DOM tree generation unit links the DOM tree that isgenerated according to the second HTML document contained in thetransport media stream beneath a node of the video element.
 2. Thereception device according to claim 1, further comprising: an elementacquisition unit which acquires a predetermined element of the secondHTML document based on the DOM tree that is generated by the DOM treegeneration unit.
 3. The reception device according to claim 2, furthercomprising: a display control unit which controls display of the webpage based on the DOM tree that is generated by the DOM tree generationunit, wherein the display control unit displays information relating topresentation control of the predetermined number of transport media on adisplay screen of the web page based on the predetermined element of thesecond HTML document that is acquired by the element acquisition unit.4. The reception device according to claim 2, wherein a program foraccessing a specific element of the second HTML document is contained inthe first HTML document, and wherein the element acquisition unitacquires the specific element of the second HTML document based on theprogram.
 5. The reception device according to claim 2, wherein referenceinformation for acquiring a program for accessing a specific element ofthe second HTML document is contained in the first HTML document, andwherein the element acquisition unit acquires the specific element ofthe second HTML document based on the program that is acquired using thereference information.
 6. The reception device according to claim 1,wherein the transport media stream is a transport stream in which firsttransport packets containing a payload of the transport media, andsecond transport packets containing information relating to thetransport media are time division multiplexed.
 7. The reception deviceaccording to claim 6, wherein the transport packets are MMT packets, andwherein the second HTML document is an MMT-CI.
 8. An informationprocessing method in a reception device including a first reception unitwhich receives a first HTML document for displaying a web page; and asecond reception unit which receives a transport media stream containinga predetermined number of transport media and a second HTML document aspresentation control information of the transport media, the methodcomprising: a step of generating a DOM tree of the first HTML documentthat is received by the first reception unit; and a step of linking theDOM tree that is generated according to the second HTML documentcontained in the transport media stream beneath a node of a videoelement when the video element that references the transport mediastream is present in the first HTML document.
 9. An informationprocessing device, comprising: a data acquisition unit which acquires afirst HTML document for displaying a web page; and a DOM tree generationunit which parses the first HTML document that is acquired by the dataacquisition unit and generates a DOM tree in which a plurality ofelements are associated with each other, wherein, when a video elementthat references a transport media stream containing a predeterminednumber of transport media and a second HTML document as presentationcontrol information of the transport media is present in the first HTMLdocument, the DOM tree generation unit links the DOM tree that isgenerated according to the second HTML document contained in thetransport media stream beneath the video element.